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Abstract The simplified Lennard-Jones (LJ) potential minimization problem is 

N N 



minimize 



f{x) = 4^ ^ -g- 3- subject to X G 



= 1 j = lj<i \^^J ^^L 



wherer^j = {x^i-2-x^j-2Y ^{x^i-i-x^j-iY ^{x^i-x^jY , (3^3i-2, ^3i-i, ^3i) is the co- 
ordinates of atom i in R^, z, j = 1, 2, . . . , A^(> 2 integer), n — 3N and N is the whole 
number of atoms. The nonconvexity of the objective function and the huge number 
of local minima, which is growing exponentially with N, interest many mathematical 
'^ \ optimization experts. In this paper, the canonical dual theory elegantly tackles this 

problem illuminated by the amyloid fibril molecular model building. 
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1 Introduction 

Neutral atoms are subject to two distinct forces in the limit of large distance and short 
distance: a dispersion force (i.e. attractive van der Waals (vdw) force) at long ranges, 
and a repulsion force, the result of overlapping electron orbitals. The Lennard-Jones (L- 
J) potential represents this behavior (http://en.wikipedia.Qrg/wi ki/Lennard-jQnes-pQtentiaIl 
or [9j and references therein). The L-J potential is of the form 

V{r) = As \C-r - C-f] , (1) 

where r is the distance between two atoms, e is the depth of the potential well and a 
is the atom diameter; these parameters can be fitted to reproduce experimental data 
or deduced from results of accurate quantum chemistry calculations. The (^)^^ term 
describes repulsion and the (^)^ term describes attraction (Fig. [1]). In Fig. [T]we may 
see two points: (I) V{r) — (but the value of V{r) is not the minimal value) when 
r — a (i.e. the distance between two atoms equals to the sum of atom radii of the 
atoms); and (II) when r = 2^/^a (i.e. the distance between two atoms equals to the 
sum of vdw radii of the atoms), the value of V{r) reaches its minimal value —e (i.e. 
the bottom of the potential well; the force between the atoms is zero at this point). 
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Figure 1: The Lennard-Jone Potential (formulas ([T]) and (fT5]) ) (This Figure can be 

lOUnQ m WeDSllJe ^http://hornepage.rnac.corn/swain/CMC/DDResources/rnol_interactions/rnolecular_interactions.htrnl| 



This paper is written based on (II). If we introduce the coordinates of the atoms whose 
number is denoted by N and let £ = a = 1 be the reduced units, the form ^ becomes 

N N 
1=1 j=lj<i 

where Tij = {xsi-2 - xsj-2)'^ + {xsi-i - xsj. 

(x3^_2, xsi-i^xsi) is the coordinates of atom z, z,j = l,2,...,A^(>2). The minimization 

of L-J potential f{x) on R^ (where n = 3A^) is an optimization problem: 



-1)^ + (xsi - xsj)^ = \\Xi 



Xj\\2 and 



min fix) 



(3) 



This optimization problem interests many optimization experts, for example, Pardalos 
[lU], Xue [laillllSllISlE], Huang [HE] et al. 



For (|3]), when its global optimization solution is reached, the value r in ([T]) should be 
the sum of two vdw radii of the two atoms interacted. The three dimensional structure of 
a molecule with N atoms can be described by specifying the 3-Dimensional coordinate 
positions Xi, X2, . . . , X^^ G M^ of all its atoms. Given bond lengths r^j between a 
subset S of the atom pairs, the determination of the molecular structure is 



(Vo) to find Xi,X2,...,Xn s.t. \\Xi - Xj\\ = rij,{ij) e S, 



(4) 



where 1 1 • 1 1 denotes a norm in a real vector space and it is calculated as the Euclidean 
distance 2-norm in this paper. (|4]) can be reformulated as a mathematical global opti- 
mization problem (GOP) 



(V) minP(X) = E(.,,)e^^^.(ll^^ 



XjW 



r2.)2 



(5) 



in the terms of finding the global minimum of the function P{X), where Wij, {hj) ^ S 
are positive weights, X = (Xi, X2, . . . , X^r)"^ G R^^^ [TT] and usually S has many 
fewer than A^^/2 elements due to the error in the theoretical or experimental data 
[HI [18]. There may even not exist any solution Xi, X2, . . . , X^ to satisfy the distance 
constraints in dH), for example when data for atoms i,j, /c G S violate the triangle 
inequality; in this case, we may add a perturbation term —e^X to P{X)\ 

(V,) minP,(X) = E(ij)es ^ijiW^i - X,\\^ - r^-)^ - e^X, (6) 

where e > 0. Thus, the L-J potential optimization problem ([3]) is rewritten into the 
optimization problem ([6]). 

Problem ([6]) is just the minimization problem of sum of fourth-order polynomials, 
which can be elegantly solved by the canonical dual theory (CDT) in optimization 
[31 m [5]. We apply the above theory to an amyloid fibril molecular model building 
problem. The rest of this paper is arranged as follows. In the next section, i.e. Section 
2, the CDT will be briefly introduced and its effectiveness will be illuminated by ap- 
plying the CDT-based optimization approach to the famous double-well problem. In 
Section 3, the molecular model building works of prion AGAAAAGA amyloid fibrils 
will be done by the CDT. We find that just very slight refinement is needed by the 
optimization programs of computational chemistry package Amber 11 to get the op- 
timal molecular models. This implies to us the effectiveness of the CDT to solve our 
L-J potential minimization problem. Thus, when using the time-consuming and costly 
X-ray crystallography or NMR spectroscopy we still cannot determine the 3D structure 
of a protein, we may introduce computational approaches or novel mathematical for- 
mulations and physical concepts into molecular biology to study molecular structures. 
These concluding remarks will be made in the last section, i.e. Section 4. 



2 The Canonical dual optimization approach 

We briefly introduce the CDT of [3[[ll|5j specially for solving the following minimization 
problem of the sum of fourth-order polynomials: 



(V) : min J P(x) = V Wi(x) + -x^Qx - x^ f : x eW"} , 

^ \ U ^ J 

where Wi{x) = -ai (-x^AiX + hJx^cA ,Ai = AJ,Q = Q^ eR'''''', 

hi J ^W,Ci,ai GM\i = 1,2, ...,n,x G Af CM''. 
The dual problem of {V) is 

{V") : max {p%) = f] (c,q - \ai\^^ - \f^{^)G+{<;)F{,) : ? G 5 J , 



(7) 



(8) 



where F{<;) = / - J]^i6i,G(?) ^Q + Y,^iAi,Sa = {? G W^\F(s) e Col{G{<;))}, 



i=l 



G^ denotes the Moore-Penrose generalized inverse of G, and Col{G{(;)) is the column 
space of G{(;). The prime-dual Gao-Strang complementary function of CDT [SI HIE] is 



m 



i(x,0 -^^ 



1=1 






For (V) and (V^) we have the following CDT: 



+ ^x^Qx - x^f. (9) 



Theorem 1 Q^ The problem (V^) is canonically dual to (V) in the sense that if 
^ is a critical point of P^{q), then x — G^{^)F{^) is a critical point of P{x) on MJ^, and 
P{x) — P^(^). Moreover, if ^ ^ S^ = {^ G Sa\G{(^) y 0}, then ^ is a global maximizer 
of P^{<;) over S'^ , x is a global minimizer of P{x) on M^^ and 

P{x) = min P{x) = S(x,^) = maxP^((;) = P^(^). (10) 

It is easy to prove that the canonical dual function P^{<^) is concave on the convex dual 
feasible space S'^ . Therefore, Theorem 1 shows that the nonconvex primal problem {V) 
is equivalent to a concave maximization problem (V^) over a convex space 5^, which 
can be solved easily by well-developed methods. Over S~ — {<; ^ Sa\G{<;) -< 0} we have 
the following theorem: 

Theorem 2 /5/ Suppose that ^ is a critical point of (V^) and the vector x is defined 
by X — G^{^)F{q). If ^ ^ S~ , then on a neighborhood Xo x So C X x S~ of (x^^), we 
have either 

P{x) = min P{x) = S(x,^) = minP^((;) = P^(^), (11) 

or 

P{x) = maxP(x) = S(x,^) = maxP^((^) = P^(^). (12) 

By the fact that the canonical dual function is a d.c. function (difference of convex 
functions) on S~ ^ the double-min duality (ITT|) can be used for finding the biggest local 
minimizer of (V) and (P^), while the double-max duality (IT2]) can be used for finding 
the biggest local maximizer of (P) and (P^). In physics and material sciences, this pair 
of biggest local extremal points play important roles in phase transitions. 

To illuminate the CDT above-mentioned, we minimize the well-known Double Well 
potential function [Sj (blue colored in Fig. [2j): 

P{x) = \{\x^-2f-\x. (13) 

We can easily get S(x, <;") = (|x^ — 2)^ — |^^ — |x, 

^'W = -^-^?'-2? (14) 

(red colored in Fig. E]) and 5+ = {? G M}\<; > 0}. Let H(x, <;)' = 0, we get three critical 
points of S(a;,?): {x^,q^) = (2.11491, 0.236417), (x^,?^) = (-1.86081, -0.268701), (x^,?^) 




Figure 2: The prime and dual double- well functions (Prime: blue, Dual: red). 

(-0.254102,-1.96772). By Theorem 1, we know x^ = 2.11491 is the global mini- 
mizer of ([ED, ^^ = 0.236417 is the global maximizer of ([H]) over S^, and P{^^) = 
E{x^,^^) = P^{^^) — —1.02951. By Theorem 2, we know that the local minimizers: 
x2 = -1.86081,(^2 _ -0.268701 (over S'), P{^^) = E{x^,^^) = P^(^2) _ 0.9665031 and 
the local maximizers: x^ = -0.254102,(^3) = -1.96772 (over 5"), P(^3) = ^(x^,^^) = 
pd^f) = 2.063. 



3 Applications to a L-J potential optimization problem 

In 2007, Sawaya et al. got a breakthrough finding: the atomic structures of all amyloid 
fibrils revealed steric zippers, with strong vdw interactions (LJ) between /3-sheets and 
hydrogen bonds (HBs) to maintain the /3-strands [12j. Similarly as ([T]), i.e. the potential 
energy for the vdw interactions (Fig. [1]) between /3-sheets: 



VlAv) 



A_ 

Il2 



B 

^6 ' 



the potential energy for the HBs between the /3-strands has a similar formula 



(15) 



(16) 



where A, B, C, D are constants given. Thus, the amyloid fibril molecular model build- 
ing problem is reduced to well solve the optimization problem ([3]) or ([6]) (in this paper 
we apply the CDT introduced in Section 2 to solve ([6])). 



In this section, we will use suitable templates 3nvf.pdb, 3nvg.pdb and 3nvh.pdb 
from the Protein Data Bank ( http://www.rcsb. org/D to build some amyloid fibril mod- 
els. 



3.1 3NVF 

Constructions of the AG AAA AG A amyloid fibril molecular structures of prion 113- 
120 region are based on the most recently released experimental molecular structures 
of IIHFGS segment 138-143 from human prion (PDB entry 3NVF released into Protein 
Data Bank (http://www.rcsb.org') on 2011-03-02) [Ij. The atomic-resolution structure 
of this peptide is a steric zipper, with strong vdw interactions between /3-sheets and 
HBs to maintain the /3-strands (Fig. [3j). In Fig. [3] we see that H chain (i.e. /3-sheet 2) 




Figure 3: Protein fibril structure of IIHFGS segment 138-143 from human prion. The 
purple dashed lines denote the hydrogen bonds. A, B, ..., I, J denote the 10 chains of 
the fibrils. 



of 3NVF.pdb can be obtained from A chain (i.e. /3-sheet 1) by 

-1 \ / 27.546 

H = \ -1 L4+ 

1/ \ 

and other chains can be got by 

\ / 

C{G)^A{H)+\ ] ,B{F)^A{H) + 2\ 
4.8 / V 4.8 



(17) 



(18) 




D{I)^A{H)-\ ],E{J)^A{H)-2\ |. (19) 

4.8 

Basing on the template SNVF.pdb from the Protein Data Bank, three prion AGAAAAGA 
paUndrome amyloid fibril models - an AGAAAA model (3nvf-Model 1), a GAAAAG 
model (3nvf- Model 2), and an AAAAGA model (3nvf-Model 3) - will be successfully 
constructed in this paper. Chain A of 3nvf-Models 1-3 were got from A Chain of 
3NVF.pdb using the mutate module of the free package Swiss-Pdb Viewer (SPDBV 
Version 4.01) (http://sp dbv.vital-it.ch| ). It is pleasant to see that almost all the hydro- 
gen bonds are still kept after the mutations; thus we just need to consider the vdw 
contacts only. Making mutations for H Chain of 3NVF.pdb, we can get H Chain 
of 3nvf-Models 1-3. However, we find that the vdw contacts between A Chain and 
H Chain are too far at this moment. We know that for 3nvf- Model 1 at least the 
vdw interaction between A.GLY2.CA-H.GLY2.CA, A.ALA4.CB-H.GLY2.CA should 
be maintained, for 3nvf-Model 2 at least three vdw interactions between A.ALA4.CB- 
H.ALA2.CB, A.ALA2.CB-H.ALA2.CB, A.ALA2.CB-H.ALA4.CB should be maintained, 
and for 3nvf-Model 3 at least three vdw interactions between A.ALA2.CB-H.ALA2.CB, 
A.ALA2.CB-H.ALA4.CB, A.ALA4.CB-H.ALA2.CB should be maintained. Fixing the 
coordinates of A.GLY2.CA and A.ALA4.CB (two anchors) ((-10.919,-3.862,-1.487), 
(6.357,1.461,-1.905)) for 3nvf-Model 1, fixing the coordinates of A.ALA2.CB and A.ALA4.CB 
(two anchors) ((11.959,-2.844,-1.977), (6.357,1.461,-1.905)) for 3nvf-Models 2-3, letting 
d equal to the twice of the vdw radius of Carbon atom (i.e. d = 3.4 angstroms), and let- 
ting the coordinates of H.GLY2.CA of 3nvf-Model 1 (two sensors) and the coordinates 
of H.ALA2.CB and H.ALA4.CB of 3nvf-Models 2-3 (two sensors) be variables, we may 
get a simple MDGP with 3/6 variables and its dual with 2/3 variables for 3nvf-Model 
1: 

P.ixi) = ^ {{xn + 10.919)2 + (xi2 + 3.862)2 ^ ^^^^ ^ ^437^2 _ ^^2^2 ^ 

^ {{xu - 6.357)2 ^ ^^^^ _ j_4gj)2 ^ ^^^^ ^ j_9Q5)2 _ 3_42|2 _ 

(0.05x11 + 0.05x12 + 0.05x13), 



^n?i, ?2) = 124.7908^1 - 2?r + 34.615?2 - -?2 - 

0.05 - 21.838?i -I- 12.714?2 \ '^ / lOTJ ^ ^ 

0.05 - 7.724?i + 2.922^2 ^^^ 

0.05 - 2.974<;^i - 3.81?2 / \ 



2?i+2« 



0.05 - 21.838^1 + 12.714?2 
0.05 - 7.724?i + 2.922?2 
0.05 - 2.974?i - 3.81?2 

We can get a global maximal solution (70.1836,70.1812) for P^{^i,^2) and its corre- 
sponding local maximal solution to Pe(xi): 

X = (-2.28097, -1.20037, -1.69582). 



By Theorem 1 we know that x is a global minimal solution of Pe{xi). Thus we get 



H 



-1 


^ 


\ / 


-4.5619 





-1 


]a + 


-2.4009 





1 . 


1 \ 


0.0004 



(20) 



for 3nvf-Model 1, whose other chains can be got by (|T^ - (fT^ (Fig. S]). For 3nvf-Models 
2-3, similarly we may get a simple MDGP with 6 variables and its dual with 3 variables: 

P,ixi,X2) = ^ {{xu - 11.959)2 + (xi2 + 2.844)2 + (xis + 1.977)^ - 3.4^}^ + 
- {(X2i - 11.959)2 + (x22 + 2.844)2 + (x23 + 1.977)2 - 3.42}^ + 
I {(xii - 6.357)2 ^ (^^^2 _ ^^Q-^^2 ^ ^^^^ ^ ^9(35)2 _ ^^2^^ _ 
(0.05x11 + 0.05x12 + 0.05x13 + 0.05x2i + 0.05x22 + 0.05x23), 



1 



1 



iTfe,?2,?3) = 143.4545?i - -<;{ + 143.4545^2 - 7,4 + 34.6150^3 - 7^4 



( 0.05 + 23.9180?! + 12.7140^3 \ 
0.05 - 5.6880?i + 2.9220^3 
0.05 - 3.9540?i - 3.8100?3 

0.05 + 23.9180?2 

0.05 - 5.6880?2 

0.05 - 3.9540?2 



/ 



1 
2?i+2?3 




















1 














2<ri+2?3 








1 











2?i+2?3 











1 

2?2 




















1 

2?2 




















1 

2?2 



/ 0.05 + 23.9180^1 + 12.7140^3 \ 
0.05 - 5.6880?i + 2.9220?3 
0.05 - 3.9540?i - 3.8100^3 

0.05 + 23.9180?2 

0.05 - 5.6880?2 

0.05 - 3.9540?2 



We can get a global maximal solution (0.920088,0.0127286,0.921273) for Pf{s\S2S-i) 
and its corresponding local maximal solution to Pg(xi,X2): 

X = (9.16977, -0.676538, -1.9274, 13.9231, -0.879925, -0.0129248). 
By Theorem 1 we know that x is a global minimal solution of Pe(xi,X2). Thus we get 



H 



-1 


^ 


\ / 


' 20.8459 





-1 


\a^[ 


-2.1533 





1 y 


1 \ 


, 0.6638 



(21) 



for 3nvg-Models 2-3, whose other chains can be got by (IT8]l - (IT9]l (Fig. [4]). 



We find 3nvf-Model 1 has some atoms with bad/close contacts, 3nvf-Model 2 has 
6 bad/close contacts, and 3nvf-Model 3 has no bad/close contact. This means it not 
necessary at ah to further refine 3nvf-Model 3. We remove these bad contacts by per- 
forming energy minimization using Amber 11 ^. Even if there are no obvious bad 




Figure 4: Protein fibril structure of 3nvf-Models 1-3 (from left to right respectively) for 
prion AGAAAAGA segment 113-120 . The purple dashed lines denote the hydrogen 
bonds. A, B, ..., I, J denote the 10 chains of the fibrils. 

contacts, it is still a good idea to run a short energy minimization to relax the struc- 
tures a bit. We will perform the energy minimization in 2 stages. In the first stage, 
we'll only minimize the water molecules and hold the protein fixed for 500 steps of 
steepest descent method and then 500 steps of conjugate gradient method. Our goal 
is just to remove bad contacts, there is no need to go overboard with minimization. 
In the second stage we proceed directly to minimizing the entire system as a whole 
for 1500 steps of steepest descent method and then 1000 steps of conjugate gradient 
method. RMSD (root mean square deviation) is an indicator for structural changes in 
a protein. It is used to measure the scalar spatial distance between atoms of the same 
type (for example the Cq, atoms) for two structures in different time. The RMSDs 
between the last snapshot after the refinement and the snapshot illuminated in Fig. H] 
are 2.15796, 1.3089087, 1.045318 angstroms for these three 3nvf-Models respectively. 
The very small values of RMSD are very good measure of precision of CDT for our 
model building. This shows us that CDT performs well. 



3.2 3NVG 

In this subsection, the constructions of the AGAAAAGA amyloid fibril molecular 
structures of prion 113-120 region are based on the most recently released experi- 
mental molecular structures of MIHFGN segment 137-142 from mouse prion (PDB 



entry 3NVG released into Protein Data Bank ( |http://ww w.rcsb.org|) on 2011-03-02) 
[1]. The atomic-resolution structure of this peptide is a steric zipper, with strong vdw 
interactions between /3-sheets and HBs to maintain the /3-strands (Fig. [5]). In Fig. [5] 
we see that H Chain (i.e. /3-sheet 2) of 3NVG.pdb can be obtained from A Chain (i.e. 
;5-sheet 1) by 

-1 \ / -27.28 

H = \ 1 h4+ 2.385 | , (22) 

0-1/ \ 15.738 




Figure 5: Protein fibril structure of MIHFGN segment 137-142 from mouse prion. The 
purple dashed lines denote the hydrogen bonds. A, B, ..., I, J denote the 10 chains of 
the fibril. 

and other chains can be got by 

\ / 

C{G) = A{H) + I 4.77 ,B{F) = A{H) + 2 4.77 | , (23) 



Dil) = AiH) 



4.77 


,E{J)-- 


= A{H) - 2 


4.77 









. 



(24) 



Basing on the template SNVG.pdb from the Protein Data Bank, three prion AGAAAAGA 
palindrome amyloid fibril models - an AGAAAA model (3nvg-Model 1), a GAAAAG 
model (3nvg-Model 2), and an AAAAGA model (3nvg-Model 3) - will be successfully 
constructed in this paper. Chain A of 3nvg-Models 1-3 were got from A Chain of 
3NVG.pdb using the mutate module of the free package Swiss-Pdb Viewer (SPDBV 
Version 4.01) ([http: //spdbv.vital-it.chD . It is pleasant to see that almost all the hydrogen 
bonds are still kept after the mutations; thus we just need to consider the vdw contacts 



10 



only. Making mutations for H Chain of SNVG.pdb, we can get the H Chains of 3nvg- 
Models 1-3. However, the vdw contacts between A Chain and H Chain are too far at 
this moment. We know that for 3nvg- Model 1 at least the three vdw interaction be- 
tween A.GLY2.CA-H.GLY2.CA, A.GLY2.CA-H.ALA4.CB, A.ALA4.CB-H.GLY2.CA 
should be maintained, for 3nvg-Model 2 at least the three vdw interactions between 
A.ALA2.CB-H.ALA2.CB, A.ALA2.CB-H.ALA4.CB, A.ALA4.CB-H.ALA2.CB should 
be maintained, and for 3nvg-Model 3 at least the three vdw interactions between 
A.ALA2.CB-H.ALA2.CB, A.ALA2.CB-H.ALA4.CB, A.ALA4.CB-H.ALA2.CB should 
be maintained. Fixing the coordinates of A.GLY2.CA and A.ALA4.CB (two an- 
chors) ((-11.159,-2.241,4.126), (-5.865,-2.618,8.696)) for 3nvg-Model 1, fixing the coordi- 
nates of A.ALA2.CB and A.ALA4.CB (two anchors) ((-12.040,-2.675,5.307), (-5.865,- 
2.618,8.696)) for 3nvg-Models 1-2, letting d equal to the twice of the vdw radius of 
Carbon atom (i.e. d = 3.4 angstroms), and letting the coordinates of H.GLY2.CA 
and H.ALA4.CB of 3nvg-Model 1 (two sensors) and the coordinates of H.ALA2.CB 
and H.ALA4.CB of 3nvg-Models 2-3 (two sensors) be variables, we may get a simple 
MDGP with 6 variables and its dual with 3 variables for 3nvg- Model 1: 

Pe{xi,X2) = ^ {{xu + 11.159)2 + (xi2 + 2.241)2 ^ (^^^ _ 4_j26)2 - 3.4^}^ + 
^ {(X2i + 11.159)2 + (x22 + 2.241)2 ^ ^^^^ _ 4_;^26)2 - 3.42}^ + 
- {(xii + 5.865)2 ^ ^^^^ ^ 2.618)2 + (xis - 8.696)2 _ ^^2^^ _ 
(0.05x11 + 0.05x12 + 0.05x13 + 0.05x2i + 0.05x22 + 0.05x23), 

Pf{<;i,<^2,<^3) = 135.009238?i - ^qf + 135.009238?2 - J?| + 105.3125?3 - J?| - 



/ 0.05 - 22.318?! - 11.73?3 \ 

0.05 - 4.482?i - 5.236>;^3 

0.05 + 8.252?i + 17.392?3 

0.05 - 22.318?2 

0.05 - 4.482?2 

0.05 + 8.252?2 



/ 



2?i+2?3 




















1 














2?i+2?3 








1 











2?i+2?3 











1 

2q 




















1 

2?2 




















1 

2?2 



/ 0.05 - 22.318?i - 11.73?3 \ 

0.05 - 4.482?i - 5.236>;^3 

0.05 + 8.252>;^i + 17.392>;^3 

0.05 - 22.318>;^2 

0.05 - 4.482>;^2 

0.05 + 8.252>;-2 



We can get a global maximal solution (0.708403,0.0127287,0.699001) for P/(?i,?2,?3) 
and its corresponding local maximal solution to Pe{xi,X2): 

X = (-8.51192, -2.41048, 6.4135, -9.19493, -0.276929, 6.09007). 
By Theorem 1 we know that x is a global minimal solution of Pe{xi,X2). Thus we get 



H^ 



-1 





^ 


\ / 


-18.133923 





1 





]a+1 


0.6673703 








-1 


1 \ 


11.955023 



(25) 



11 



for 3nvg-Model 1, whose other chains can be got by (l23]) - (f2l|) (Fig. E]). For 3nvg- Models 
2-3, similarly we may get a simple MDGP with 6 variables and its dual with 3 variables: 
1 



Pe{xi,X2) 



{{xu + 12.040)2 + (xi2 + 2.675)2 + (xis - 5.307)^ - 3.4^} + 



- {(X2i + 12.040)2 + (x22 + 2.675)2 + (x23 - 5.307)2 _ ^^2^ ^ 

- {(xii + 5.865)2 ^ ^^^^ ^ 2.618)2 + {xn - 8.696)2 _ ^^2^ _ 
(0.05x11 + 0.05x12 + 0.05x13 + 0.05x2i + 0.05x22 + 0.05x23), 



1 



1 



Pe (?i,<^2,?3) = 168.721474^1 - -qf + 168.721474.^2 - -?2 + 105.312565^3 - -?3 



/ 0.05 - 24.080^1 - 11.73?3 \ 

0.05 - 5.35?i - 5.236?3 

0.05 + 10.614?i + 17.392?3 

0.05 - 24.080?2 

0.05 - 5.35?2 
0.05 + 10.614?2 



V 

/ 0.05 - 24.080?i - 11.73?3 \ 

0.05 - 5.35?i - 5.236?3 

0.05 + 10.614^1 + 17.392?3 

0.05 - 24.080^2 

0.05 - 5.35^2 

0.05 + 10.614>;^2 



/ 



1 
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2?i+2?3 
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2?i+2?3 
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2<ri+2<r3 
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2?2 






















1 

2?2 






















1 


/ 



We can get a global maximal solution (0.849735,0.0127287,0.84036) for Pf{<^i,<^2,<^3) 
and its corresponding local maximal solution to Pe(xi,X2): 

X = (-8.95484, -2.63187, 7.00689, -10.0759, -0.710929, 7.27107). 
By Theorem 1 we know that x is a global minimal solution of Pe(xi,X2). Thus we get 

/ -1 \ / -19.3102 \ 

H= i 1 M + 0.6644 . (26) 

\ -1 / V 13.5316 / 

for 3nvg-Models 2-3, whose other chains can be got by (I23]l - (l24]l (Fig. [6]). 

We did same refinements for 3nvg-Models 1-3 as for 3nvf-Models 1-3. The RMSDs 
between the last snapshot after the refinement and the snapshot illuminated in Fig. [6] 
are 1.572438, 1.404648, 1.464767 angstroms for these three 3nvg-Models respectively. 
The very small values of RMSD show us that CDT performs well and precisely for 
3nvg-Model building. 



3.3 3NVH 

Similar as the above two subsections, this subsection constructs the AGAAAAGA amy- 
loid fibril molecular structures of prion 113-120 region basing on the most recently re- 
leased experimental molecular structures of MIHFGND segment 137-143 from mouse 
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Figure 6: Protein fibril structure of 3nvg-Models 1-3 (from left to right respectively) for 
prion AGAAAAGA segment 113-120 . The purple dashed lines denote the hydrogen 
bonds. A, B, ..., I, J denote the 10 chains of the fibrils. 



prion (PDB entry 3NVH released into Protein Data Bank (http://www.rcsb.org) on 
2011-03-02) [Ij. The atomic-resolution structure of this peptide is a steric zipper, with 
strong vdw interactions between /3-sheets and HBs to maintain the ;5-strands (Fig. [7j). 
In Fig. [3 we see that H chain (i.e. /3-sheet 2) of 3NVH.pdb can be obtained from A 
chain (i.e. /3-sheet 1) by 



H 



and other chains can be got by 



-1 


^ 


\ / 





1 





u+ 


2.437 





-1 


/ \ 


-15.553 



\ / 

C{G)^A{H)+\ 4.87 \,B{F)^A{H) + 2\ 4.87 



(27) 



(28) 



D{I) = A{H) 





4.87 




,E{J)^A{H)-2 





4.87 




(29) 



Basing on the template 3NVH.pdb from the Protein Data Bank, three prion AGAAAAGA 
palindrome amyloid fibril models - an AGAAAAG model (3nvh-Model 1), a GAAAAGA 
model (3nvh-Model 2) - will be successfully constructed in this paper. A chain of 3nvh- 
Models 1-2 were got from A chain of 3NVH.pdb using the mutate module of the free 
package Swiss-Pdb Viewer (SPDBV Version 4.01) (http:/7spdbv.vital-it.ch). It is pleas- 
ant to see that almost all the hydrogen bonds are still kept after the mutations; thus 
we just need to consider the vdw contacts only. Making mutations for H chain of 
3NVH.pdb, we can get the H chains of 3nvh-Models 1-2. However, the vdw contacts 
between A chain and H chain are too far at this moment (>4.25 Angstroms), we 
may know that for 3nvh-Models 1-2 at least one vdw interaction between A.ALA4.CB- 
II.ALA4.CB should be maintained. Fixing the coordinates of A.ALA4.CB (the anchor) 
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Figure 7: Protein fibril structure of MIHFGND segment 137-143 from mouse prion. 
The purple dashed lines denote the hydrogen bonds. A, B, ..., I, J denote the 10 chains 
of the fibril. 

((1.731,-1.514,-7.980)), letting d equal to the twice of the vdw radius of Carbon atom 
(i.e. d = 3.4 angstroms), and letting the coordinate of H.ALA4.CB (one sensor) be 
variables, we may get a simple MDGP with 3 variables and its dual with 1 variable: 



p^^i) 



P^{xi) = - {(xii - 1.731)2 + (xi2 + 1.514)2 + (xi3 + 7.980)2 _ ^^2^2 

-0.05x11 - 0.05x12 - 0.05x13, 

57.409?i - ^?f 

(0.05 + 3.462^i)2 + (0.05 - 3.028?i)2 + (0.05 - 15.96?i)2 

4^^ ■ 

We can easily get the global maximal solution 0.0127287 G {? G IR^Iq > 0, i = 1} for 
Pf(z,\). Then, we get its corresponding solution for Pg(xi): 

X = (3.69507, 0.450071, -6.01593). 
By Theorem 1 we know that x is a global minimal solution of ^^(xi), i.e. for H.ALA4.CB. 
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Thus we get 



H 



-1 











1 











— ' 



A + 



5.42607 

1.964071 

-13.99593 



(30) 



for 3nvh-Models 1-2, whose other chains can be got by f[28]) - (l29l) (Fig. [8]). 




Figure 8: Protein fibril structure of 3nvh-Models 1-2 (from left to right respectively) for 
prion AGAAAAGA segment 113-120 . The purple dashed lines denote the hydrogen 
bonds. A, B, ..., I, J, K, L denote the 12 chains of the fibrils. 



We carried on the same refinements for 3nvh-Models 1-2 as for 3nvf-Models 1-3 
and 3nvg-Models 1-3. The RMSDs between the last snapshot after the refinement 
and the snapshot illuminated in Fig. [8] are 1.534417, 1.572836 angstroms for the two 
3nvh-Models respectively. The very small values of RMSD again show to us that CDT 
performs well and precisely for 3nvg-Model building. 



3.4 Refined 3nvf-Models 1-3, 3nvg-Models 1-3, 3nvh-Models 1-2 

The amyloid fibril models of prion AGAAAAGA segment refined by Amber 11 are illu- 
minated in Fig.s [9lim All these models are without any bad contact now (checked by 
package Swiss-Pdb Viewer) , and the vdw interactions between the two /3-sheets are in 
a very perfect way now. All the initial structures before dealt by CDT approach have 
very far vdw contacts between the two /3-sheets. CDT easily made the vdw contacts 
come closer and reach a state with the lowest potential energy, which has perfect vdw 
contacts as shown in Fig.s [QllTTl 



4 Conclusion 

Global optimization of Lennard- Jones clusters is a challenging problem for researchers 
in the field of biology, physics, chemistry, computer science, materials science, and es- 



15 






Figure 9: Perfect 3nvf-Models 1-3 (from left to right respectively) for prion 
AGAAAAGA segment 113-120 . The purple dashed lines denote the hydrogen bonds. 
A, B, ..., I, J denote the 10 chains of the fibrils. 






Figure 10: Perfect 3nvg-Models 1-3 (from left to right respectively) for prion 
AGAAAAGA segment 113-120 . The purple dashed lines denote the hydrogen bonds. 
A, B, ..., I, J denote the 10 chains of the fibrils. 








Figure 11: Perfect 3nvh-Models 1-2 (from left to right respectively) for prion 
AGAAAAGA segment 113-120 . The purple dashed lines denote the hydrogen bonds. 
A, B, ..., I, J, K, L denote the 12 chains of the fibrils. 
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pecially for experts in matheraatical optimization research field because of the noncon- 
vexity of the L- J potential energy function and enormous local minima on the potential 
energy surface. In March 2008, American Mathematical Programming Society spe- 
cially produced one whole issue, No. 76, to discuss this problem. In this paper through 
clever use of global optimization techniques of Gao's canonical dual theory (CDT), we 
successfully tackle this challenging problem illuminated by the amyloid fibril molecu- 
lar model building. Clearly, this paper shows to readers that CDT is very useful and 
powerful to tackle challenging problems in optimization area and many other areas. 
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